Why Experimentation can be better than "Perfect Guidance"
نویسندگان
چکیده
Many problems correspond to the classical control task of determining the appropriate control action to take, given some (sequence of) observations. One standard approach to learning these control rules, called behavior cloning, involves watching a perfect operator operate a plant, and then trying to emulate its behavior. In the experimental learning approach, by contrast, the learner rst guesses an initial operation-to-action policy and tries it out. If this policy performs sub-optimally, the learner can modify it to produce a new policy, and recur. This paper discusses the relative eeectiveness of these two approaches, especially in the presence of perceptual aliasing, showing in particular that the experimental learner can often learn more eeectively than the cloning one.
منابع مشابه
Why Experimentation can be better than \ Perfect
Many problems correspond to the classical control task of determining the appropriate control action to take, given some (sequence of) observations. One standard approach to learning these control rules, called behavior cloning, involves watching a perfect operator operate a plant, and then trying to emulate its behavior. In the experimental learning approach, by contrast, the learner rst guess...
متن کاملDesigning Social Protection Programs: Using Theory and Experimentation to Understand how to Help Combat Poverty
“Anti-poverty” programs come in many varieties, ranging from multi-faceted, complex programs to more simple cash transfers. Articulating and understanding the root problem motivating government and nongovernmental organization intervention is critical for choosing amongst many anti-poverty policies, or combinations thereof. Policies should differ depending on whether the underlying problem is a...
متن کاملWhy Should We Have a Periodic Safety and Performance Program for Medical Devices
Nowadays, more than 10,000 different types of medical devices can be found in hospitals.These devices used in medical centers and hospitals for monitoring and treatment of patients require periodic safety and performance checking in order to have confidence in their functioning and operation. Physicians need better accurate medical measurements in order to better diagnose diseases, monitor pati...
متن کاملProbabilistic Infinite Secret Sharing
The study of probabilistic secret sharing schemes using arbitrary probability spaces and possibly infinite number of participants lets us investigate abstract properties of such schemes. It highlights important properties, explains why certain definitions work better than others, connects this topic to other branches of mathematics, and might yield new design paradigms. A probabilistic secret s...
متن کاملWhy we need to read and understand literature: literariness and Hans Rosling’s Factfulness (2018)
My article addresses the qualities of “good” literature and how an understanding of the nature of literary devices, so-called “literariness”, can enhance the reading experience. Focusing on Hans Rosling’s Factfulness (2018), I discuss some of the most important features of good writing. Six literary devices have been selected for special attention: point of view, tone, amplification, anecdotes,...
متن کامل